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ABSTRACT 


GENERALIZED  ORDER  STATISTICS,  BAHADUR  REPRESENTATIONS, 

AND  SEQUENTIAL  NONPARAMETRIC  FIXED-WIDTH  CONFIDENCE  INTERVALS 

Let  X^,...,Xn  be  an  i.i.d.  sample  from  df  F,  let  Hp  be  the  df  of 
h (X^, . . . ,Xm) ,  based  on  a  given  "kernel"  h (x^, . . . ,xm) ,  and  consider 
confidence  interval  estimation  of  a  parameter  of  the  form  Hp-1(p). 

— ^  This  paper  introduces  confidence  intervals  formed  by  a  pair  of 

9^ 

"generalized  order  statistics,  develops  Bahadur-type  representation 
theory  for  these  order  statistics,  and  constructs  corresponding 
sequential  fixed-width  confidence  interval  procedures.  Previous  work 
of  Bahadur  (1966)  and  Geertsema  (1970)  is  sharpened  and  extended. 
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1.  Introduction.  In  this  paper  we  introduce  a  notion  of  generalized 


order  statistics,  develop  relevant  asymptotic  theory,  and  apply  the 
results  to  characterize  the  convergence  properties  of  a  class  of 
sequential  nonparametric  fixed-width  confidence  interval  procedures. 
Previous  work  of  Bahadur  (1966)  and  Geertsema  (1970)  is  broadly  extended, 
in  close  connection  with  ideas  introduced  in  Serfling  (1984) . 

Let  X^,...,Xn  be  independent  random  variables  having  common 
distribution  function  (df)  F.  (More  generally,  the  X^'s  may  be  random 
elements  of  an  arbitrary  space.)  Let  h  be  a  function  from  ]Rm  to  IR 
and  denote  by  H_,  the  df  of  h (X. , . . . ,X  ) .  Estimation  of  parameters  of 
F  which  are  expressible  as  T(Hp),  where  T(*)  is  a  general  form  of 
L-functional,  has  been  considered  by  Serfling  (1984)  and  Janssen, 
Serfling,  and  Veraverbeke  (1984) .  Here  we  confine  attention  to  the 
special  case  of  quantile  L-functionals  and  hence  to  parameters  of  the 
form  Hp  1(p),0  <  p  <  1,  and  we  investigate  nonparametric  confidence 
intervals  formed  by  a  pair  of  the  "generalized  order  statistics" 


(1.1) 


<  W 


n,n 


(m) 


I 


the  ordered  values  of  h(X.  ,..., X.  )  taken  over  the  n.  .  = 

X1  m  (nu 

n  (n-1)  . . .  (n-m+1)  m-tuples  (i^...,i  )  of  distinct  elements  from  {l,...,n}. 
For  any  such  parameter  H  ^(p),  the  relevant  sequential  confidence 

F 

interval  will  be  given  by 
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'  N ,  a  (N)  '  N,b(N)" 


where  the  "rank  functions"  a(n)  and  b(n)  are  selected  so  that  (1.2) 
has  specified  asymptotic  coverage  probability  for  N  =  n  (nonrandom)  -*• 
and  where  N  is  a  random  sample  size  selected  for  (1.2)  to  have  specified 
fixed  width. 


For  the  case  h(x)  =  x,  (1.1)  gives  the  usual  order  statistics  of 
the  sample  and  (1.2)  represents  a  sequential  version  of  the  classical 
method  of  giving  a  nonparametric  distribution-free  confidence  interval  for 
a  quantile  F  1  (p)  .  For  p  =  *3,  this  sequential  approach  and  also  the 
one  based  on  hfx.jX^)  =  (x^  +  ^2) /2  were  introduced  and  investigated 
by  Geertsema  (1970)  as  competing  approaches,  in  the  case  of  symmetric 
df  F,  for  estimation  of  the  location  parameter  H^,  1(ss)  =  F-1(>j).  In 
particular,  with  N  =  N (d)  designed  for  (1.2)  to  have  width  2d,  and 
with  a(n),b(n)  designed  to  yield  asymptotic  coverage  probability  l-2a, 
Geertsema  characterized  the  convergence  rate  of  N(d)  to  00  and 
established  the  convergence  of  the  coverage  probability  of  (1.2)  to 
1  -  2a,  as  d  -*•  0. 

In  the  present  paper  we  consider  the  behavior  of  the  sequential 
confidence  interval  (1.2)  and  the  random  sample  size  N(d)  for  the 
general  case  of  an  arbitrary  "kernel"  h (x^ , . . . ,xm) .  This  generality 
entails  the  complication,  fortuitously  absent  in  the  two  special  cases 
treated  by  Geertsema,  that  the  functions  a(n),b(n)  used  in  (1.2)  may 
be  random.  Consequently,  it  becomes  necessary  to  extend  the  repre¬ 
sentation  theorem  of  Bahadur  (1966)  for  central  order  statistics 

X  v  •  where  k  /n  -*•  p,  0  <  p  <  1,  not  only  to  the  case  of  our  generalized 
n ,  k  n 

n 

order  statistics  Vtf  .  ,  where  k  /n ,  .  -*  p,  0  <  p  <  1,  but  also  to  the 

d.k  n  f  m 1 


case  that  kn  is  random. 

In  Section  2  we  provide  some  convergence  results  on  the  empirical 

df  H  and  quantile  function  H  1  associated  with  the  W  ,  '  s,  and  we 
n  n  n,k 

use  these  results  to  define  appropriate  rank  functions  a(n),b(n)  for 
use  in  (1.2).  This  empirical  df  is  defined  by 


(1.3) 


i(y)  =  ir^^1{h(xi  '***'xi  >  <  y>»  "  00  <  y  < 

(m)  1  m 


where  the  sum  is  taken  over  the  n^  m-tuples  (i^,...,^)  of  distinct 
elements  from  {l,...,n}.  Clearly,  HR(y)  is  an  unbiased  estimator  of 
H  (y)  ,  and  H  1  (p)  provides  an  estimator  of  H_  *(p)  analogous  to  the 

r  n  r 

usual  sample  quantile  as  estimator  of  F~^(p). 

Our  extended  Bahadur  representation  for  W  .  and  related  results 

n,Kn 

are  developed  in  Section  3.  As  special  cases,  we  obtain  the  results 

of  Bahadur  (1966)  for  the  case  h(x)  =  x  and  k  nonrandom  and  of 

n 

Geertsema  (1970)  for  the  case  h(x1,x2)  =  (x^^  +  x2)/2  and  kR  nonrandom, 
under  relaxations  of  their  regularity  conditions  on  F.  The  results  of 
Section  3  and  in  part  Section  2  are  of  general  interest,  besides  their 
applications  in  this  paper. 


Section  4  carries  out  general  application  to  the  class  of  sequential 
nonparametric  confidence  intervals  of  form  (1.2).  The  two  examples 
treated  by  Geertsema  (1970)  are  obtained  as  special  cases,  but  under 
weaker  regularity  conditions  on  F. 

The  random  a(n),b(n)  used  in  defining  (1.2)  are  constructed  in 
terms  of  an  estimator  for  a  parameter  appearing  in  the  asymptotic 
distribution  of  the  random  variable  Hn  (H  (p) ) .  It  is  necessary  for 


our  theory  in  Section  4  that  the  estimator  be  strongly  consistent. 

Such  an  estimator  is  developed  in  Section  5. 

We  conclude  this  introduction  with  selected  examples  to  which  the 
methods  of  Section  4  may  be  applied. 

(i)  location  estimation :  One  may  view  the  cases  considered  by 
Geertsema  (1970)  as  two  special  cases  of  the  class  of  kernels  given 
by  h(x^, .  . .  ,xm)  =  (x.^  +  . . .  +  x^J/m,  for  m  =  1,2,3, .. .  .  For  symmetric  F 
the  corresponding  parameters  Hp  ^(S)  all  reduce  to  F  ^(Sj),  so  that  the 
corresponding  estimators  given  by  ^(>j)  are  competitors  for  the 
same  goal.  A  comparative  study  of  these  estimators  for  m  =  1,...,5 
has  been  carried  out  in  Choudhury  (1984) ,  on  the  basis  of  which  a 
particular  choice  of  m  may  be  selected  and  the  results  of  Section  4 
utilized  to  provide  associated  sequential  fixed-width  confidence 
intervals  for  F~1(!i). 

m 

More  generally,  let  us  consider  the  kernel  h (x^ , . . . ,xm)  =  £  aixi 
m 

with  £  a .  =  1  (but  the  a . ' s  otherwise  unrestricted) .  For  symmetric  F 
i=l  1  1 

the  corresponding  parameter  H„  ^(*5)  reduces  in  each  case  to  F  ^(*j), 

r 

but  the  corresponding  estimators  ^  ( *a )  differ  and  therefore  are 
competitors.  The  case  m  =  2  was  introduced  by  Maritz,  Wu  and  Staudte 
(1977)  and  studied  as  a  special  case  of  the  class  of  M2~estimators  of 
Huber  (1964),  by  switching  to  the  closely  related  estimators  H„  ^(4), 

where  Ffi  is  the  usual  sample  df.  They  established  asymptotic  normality 
and  examined  asymptotic  relative  efficiencies,  among  other  aspects. 
However,  by  noting  that  the  statistics  Hn  ^(Jj)  are  special  cases  of 
the  generalized  L-statistics  of  Serfling  (1984) ,  we  can  treat  them 


directly  and  obtain  not  only  the  relevant  asymptotic  convergence 

theory  but  also  (by  our  Section  4)  associated  sequential  fixed-width 

confidence  intervals.  From  the  numerical  studies  of  Maritz,  Wu  and 

Staudte  (1977)  for  the  case  m  =  2  and  arbitrary,  and  of  Choudhury 

(1984)  for  the  case  =  ...  =  am  =  1/m,  with  ra  arbitrary,  it  is 

found  that  the  classical  median  and  Hodges-Lehmann  estimators  can  be 

successfully  competed  with  in  various  situations  by  the  estimators 

Hn  corresponding  to  choices  of  even  outside  the  interval 

[0,1]  and  choices  of  m  >  2.  It  would  be  of  interest  to  extend  these 

two  previous  studies  to  the  case  of  arbitrary  m,a^,...,am  subject  to 
m 

l  a,  =  1. 
i=l  * 

(ii)  spread  estimation:  Included  among  various  measures  of 
spread  discussed  by  Bickel  and  Lehmann  (1979)  is  the  median  of  the 
distribution  of  |x^  -  X2 \ ,  where  XlfX2  are  independent  r.v.'s  having 
df  F.  In  our  context,  this  is  a  "generalized  L-functional"  parameter 
Hp  ^(‘j),  where  Hp  is  based  on  the  kernel  h(xirx2>  =  |x^-x2|.  This 
can  be  estimated  by  the  generalized  L-statistic  Hn  ^(1i)  and  sequential 
fixed-width  confidence  intervals  can  be  developed  by  our  Section  4 
results.  (It  would  be  of  interest  to  consider  a  general  class  of 
spread  measures  of  this  type,  defined  by  H_  ^(*5)  with  H„  based  on  a 

r  t 

kernel  of  form 

m  m 

h(x.,...,x  )  =  l  6.x.  ,  where  £  8.  =  0.) 

x  m  i=l  1  1  1 

( iii)  regression  slope  estimation :  Consider  the  simple  linear 

regression  model  =  a  +  +  e^,  with  {e^}  i.i.d.  r.v.'s  independent 


-6- 


of  (Xi}/  and  (X^ }  a  sequence  of  random  regressors.  Let  P  denote  the 
common  df  of  the  mutually  independent  pairs  (X^,Y^) ,  1  <  i  <  n,  and 
let  Hp  denote  the  cdf  of  h ( (X^,Y^) , (X2#Y2) ) ,  where 

h( (x1,y1) , (x2,y2) )  =  (y2  “  yi}/(x2  “  xi) •  Then  clearly  a  natural 
estimator  of  the  parameter  3  is  the  median  of  the  ratios 
(Y±  -  Yj)/^  -  Xj) ,  i.e.,  the  estimator  8  =  Hn-1(Jj).  This  is  a  version 
(for  random  regressors)  of  the  well-known  estimator  of  Theil  (1950) . 
Using  the  results  of  our  Section  4,  we  can  provide  sequential  fixed- 
width  confidence  intervals  associated  with  this  estimator. 


2 .  Convergence  results  for  and  Hn_1  and  other  preliminaries. 
We  note  that,  for  each  fixed  y,  Hn(y)  is  a  U-statistic  based  on  the 
kernel 

9y<xl . xm>  '  llh<Xl . V  S  y)'  (*1'- •"*»>  €  «"• 

Consequently,  by  standard  results  on  U-statistics  (e.g..  Serf ling 
(1980) ,  Chapter  5) ,  we  have  strong  convergence  and  asymptotic 
normality : 

(2.1)  Hn  (y) - *  Hp  (y)  ,  n 

and 

(2.2)  n‘5(Hn(y)  -  H_(y)  )  N(0,m2o2  ), 

nr  y 

where  oy2  =  Varp{gyl (X) } ,  with  gyl(x)  = 

Var{E{ £Ag  . (X^  ■  ,X^  )/ml|x^  =  x),  where  £  denotes  summation  over 

*  1  m 

all  permutations  of  (l,...,m). 

We  shall  be  applying  (2.2)  with  y  =  H„  ^ (p) ,  in  which  case  a  key 


parameter  of  concern  will  be 


(2.3) 


^  °H/1(P) 


.  -1  ~  -1 
Other  convenient  notation  will  be  £  =  H_  (p)  and  £  =  H  (p) . 

p  F  ^pn  n  ^ 

By  (2.1)  and  an  argument  similar  to  the  proof  of  strong 

convergence  of  the  classical  sample  quantile  (e.g.,  Serfling  (1980), 

§2.3),  we  obtain  strong  convergence  of  £np» 

a  3L  •  S  • 

(2.4)  £ _ _ ►  £  . 


V  >..>] 


under  the  condition  that  £p  is  the  unique  solution  of 

Hp(y-)  <  £p  <  Hp (y ) .  Also,  by  Serfling  (1984),  we  have  asymptotic 

A 

normality  of  £pn» 

(2-5)  n'’%n  -  V 

where  it  is  assumed  that  H^,  has  density  hf  positive  at  £p. 

One  could  use  (2.5)  as  a  basis  for  construction  of  confidence 
intervals  for  £p,  but  this  would  entail  estimation  of  both  £p  and 
hp(£p).  Our  approach  based  on  the  generalized  order  statistics  (1.1) 
eliminates  estimation  of  the  latter  parameter. 

Let  us  now  formulate  the  rank  functions  a(n),b(n)  used  in 
defining  the  interval  (1.2).  First,  we  note  that  for  integer  kR  we 
have 


p{wn,*n  i  V  =  ptvy  ;  V"(m)> 


(2.6) 


-  HpKp))}. 


-  Mn'(Bn(E  >  -  HF({  ))  > 


(m) 


If  kn  is  defined  by  (with  $  denoting  the  standard  normal  cdf) 


(2.7) 


n 


*”1(l-a)mc'j  , 

-  =  P  +  - r - 2  +  o  (n  ^),  n  -*  «  , 


(m) 


n 


then  by  (2.2)  and  (2.6)  it  follows  that 


(2.8) 


PfWn,Kn  i  V  *  n 


Moreover,  (2.8)  remains  true  if  is  replaced  by  a  consistent 

A  L 

estimator  in  (2.7).  By  similar  arguments,  if  Cp  is  replaced  by 

-CJ*  or  in  (2.7),  then 

Jr 


(2.9) 


p { W  .  >  £  }  -*■  a,  n  ~  . 

n'  n  P 


On  this  basis  we  define  integers  a  (n) ,  b(n)  by 


(2.10) 


M2I  =  p  - 

n(m) 


$'1(l-a)m?i  , 

- +  o  (n-S 


n 


and 


(2.11) 


b(n)  = 

n ,  . 

(m) 


P  + 


$"1(l-a)mcJ  . 

- - 2  +  op  (n~; 


n 


and  assert  that 


(2.12) 


P{<Wn,a(n)'Wn,bfn>)  stains  Sp)  -  l-2c,  n-  -  . 


S.  \-;rA 


In  practice  Cp  is  unknown  and  must  be  estimated  (consistently)  to 
obtain  a(n),b(n)  satisfying  (2.10),  (2.11).  Moreover,  for  the 

sequential  analogue  of  (2.12)  obtained  in  Section  4,  we  shall  need 

A 

Cpn  to  be  strongly  consistent.  A  suitable  such  estimator  is  developed 
in  Section  5. 

For  certain  special  cases  of  kernel  h,  the  "parameter"  does 
not  depend  on  F.  For  example,  in  the  case  h(x)  =  x  we  have 
C  =  p(l-p);  in  the  case  h(Xp,x2)  =  (xp+x2)/2  and  p  =  h,  we  have 
=  1/12.  (Thus  Geertsema  (1970)  did  not  have  to  deal  with  random 
versions  of  the  functions  a(n),b(n).)  In  general,  however,  £p  depends 
upon  F.  For  example,  in  the  case  h(x^,...,x  )  =  (x^+. . . +  xfi) /m,  and 
F (x)  =  Fn (x  -  £  ) ,  and  m  >  2,  we  have 


^(F)  =  VarF{F0(m_1)  (X)  }, 


where  FQ '  '  denotes  the  k-th  order  convolution  of  Fq .  For  m  >  3,  this 
parameter  may  be  seen  to  depend  upon  F. 


3.  Generalized  order  statistics  and  Bahadur  representation  theory. 


We  consider  the  order  statistics  W  ,  defined  by  (1.1).  Our  first 

XI  t  K 

result  provides  an  a.s.  error  bound  for  W  ,  as  an  estimator  of 

It  / 

-1  n 
i  =  H„  (p) ,  when  k  /n  ,  .  converges  to  p  at  a  suitably  fast  a.s.  rat 
p  r  n  (m ) 

(This  generalizes  and  sharpens  Lemma  2.5.4c  of  Serfling  (1980),  given 
for  the  classical  order  statistics.) 

LEMMA  3.1.  Let  0  <  p  <  1.  Suppose  that  Hp  is  di f ferentiable  at 
^  "ith  H '  ( £  )  =  h„(L)  >  0.  Let  { k_ >  be  a  sequence  of  positive 


-  1 


)  such  that 


integer-valued  r.v.'s  (1  <  k^  <  n 


(3.1) 


kn/n(m)  “  P  =  o(en),n  -  «,  a.s ., 


where  {e^}  is  a  sequence  of  constants  tending  to  0  with 


(3.2) 


2  -12 

e^ndog  n)  hf(£p)/m  >  >  1,  all  n  sufficiently  large 


Then  a.s. 


(3.3)  | ^  “  ?p I  f  en»  all  n  sufficiently  large. 


PROOF.  We  must  show  that  a.s, 


(3.4)  W  ,  <  E  +  e  ,  all  n  sufficiently  large, 

n,kn  -  n'  J 


and 


(3.5)  W  .  >  E,  -  e  ,  all  n  sufficiently  large. 

n,kn  -  sp  n  J 


We  shall  prove  (3.4),  the  proof  of  (3.5)  being  similar.  Note  that 
(3.4)  is  equivalent  to 

k 


(3.6)  H  ( E  +  e  )  -  H„(E  +  e  )  >  -IL—  -  H_  (E  +  e  )  ,  all  large  n. 


n  P  F  ^P  n  "  n(m)  F  P  n 


Now,  by  application  of  a  probability  inequality  of  Hoeffding  (1963) 
(or  see  Serfling  (1980),  p.  201),  we  have 


3  •  7 )  P^Hn(^n  +  rn)  "  HF(-n  +  en}  <  t}  <  e  2[n/m]t  ,t  <  Q,n  > 


m. 


For  t=-2-,h„(S  )e  ,  the  LHS  of  (3.7)  is  seen  to  be  0(n  ),  whence 

F  p  n 

by  the  Borel-Cantelli  Lemma  we  have  that  a.s. 

(3.8)  Hn(Sp  +  en)  -  HF(Sp  +  en)  >  ~2\Up)  all  large  n. 


On  the  other  hand,  by  (3.1)  and  Young's  form  of  Taylor's  Theorem  (e.g., 
see  Serfling  (1980),  p.  45),  we  have  a.s. 


(3.9) 


n 


n 


(m) 


-  KP<S  +  €n>  -  W  -  HF(«p  +  £n>  +  °(en> 


<VEn 


o(c 


n 


) 


<  -2l5hp(Cn)  en,  all  large  n. 


Thus  (3.6)  holds  a.s.  Q 

Next  we  provide  a  modulus-of-continuity-type  result  for  the 
empirical  process  H  (•)  -  H„ ( • ) .  This  strengthens  an  earlier  version 

n  r 

given  by  Serfling  and  Thornton  (1981)  and  also  generalizes  Lemma  2.5.4E 
of  Serfling  (1980). 

LEMMA  3.2.  Let  0  <  p  <  1  and  put  £p  =  Hp  ^ (p) .  Suppose  that 

H'  is  bounded  in  a  neighborhood  of  £  ,  with  Hl(£  )  =  h„(£  )  >  0. 
r  p  r  p  r  p 

Let  {a _}  be  a  sequence  of  constants  tending  to  0  with 
(3.10)  a  n^tlog  n)  ^  >  A  >  0. 

Put 

I  I 


(3.11) 


T 


sup 


!H„(C„  +  y)  -  H „<£ „)]-[Hp(f  +  y>  -H,({  M 


y  y  Trr7T7rrr?xrrrrr  rj 


R 


r>. 

I 


Then  a.s. 


(3.12) 


Tpn  =  0 (a^  n-*5  (log  n)*5),  n  +  ® 


PROOF.  The  argument  used  for  Lemma  2.5.4E  of  Serfling  (1980) 
carries  through,  with  the  use  of  Theorem  5.6.1A  of  Serfling  (1980) 
in  place  of  Lemma  2. 5. 4A  of Serfling  (1980),  and  the  use  of  Lemma  3.1 
(above)  in  place  of  Lemma  2.5. 4C  of  Serfling  (1980).  [] 

The  next  result  provides  a  Bahadur-type  representation  for 
W  ,  . 

n,kn 

THEOREM  3.1.  Let  0  <  p  <  1  and  put  Ep  =  Hp  ^(E  ).  Suppose 

that  Hp  is  twice  differentiable  at  E  ,  with  H'(E  )  -  hf(E  )  >  0. 

P  *  P  "  P 

Let  (kn)  be  a  sequence  of  positive  integer-valued  r.v.'s  (l<kn<n^roj 
satisfying  (3.1)  and  (3.2).  Then  a.s . 


) 


Cy.V>‘ 


(: 

ill 


(3.13)  wn  k  =  Ep  +  — — h~m(g~  )'” — +  0(max{e^,  e^n-5*  (log  n)**})^ 
n  F  p 


n 


PROOF.  Under  the  assumptions  of  Theorem  3.1,  Lemma  3.1  is 
applicable,  from  which  we  have  a.s. 


(3.14) 


V*n  -  5P 


<  en,  n  -*■  °°  . 


L_ 


Since  en  >  en  for  all  n  sufficiently  large.  Lemma  3.2  (with  en  in 
place  of  an)  is  applicable,  whence  using  (3.14)  we  have 


(3'15)  'VWn,kn>  -Hn'V’  -  iHF(Wn,kn'  -HF(?p»l  *  OU^n^dog  nlS.n 
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'  \ 


Now,  by  Young's  form  of  Taylor's  Theorem  (Serfling  (1980),  p.  45) 
and  (3.14)  we  write,  a.s.. 


(3-16>  HF<wn.kn>  -vy  +  "y^-yvy +  <"„,kn-y \(y/2i 


+  o ( en)  ,  n  ■*  oo  . 


Since  Hn(Wn  k  )  =  kn/n(m)  a.s.,  (3.14),  (3.15)  and  (3.16)  yield 
(3.13).  0 


This  result  extends  the  classical  result  of  Bahadur  (1966)  (or 

see  Serfling  (1980),  p.  91)  to  the  case  of  random  and  generalizes 

to  the  W  ,  as  well.  Also,  the  regularity  condition  of  Bahadur 
'  n 

(1966)  that  F"  (in  our  general  context,  H  _")  be  bounded  in  a  neighbor- 

r 

hood  of  Sp  is  slightly  relaxed.  Indeed,  one  can  further  relax  this 
regularity  condition  and  obtain  the  following  useful  variant  of 
Theorem  3.1. 

LEMMA  3.3.  Let  0  <  p  <  1  and  put  E  =  H„  ^(E  ).  Suppose  that 

tr  *  r 

H'  is  bounded  in  a  neighborhood  of  E  /  with  H'(£  )  =  ht,(E.J  >  0.  Let 
r  p  r  p  r  p 

{k  }  be  as  in  Theorem  3.1.  Then  a.s. 
n 


(3‘17)  Wn,k  =  S  + 

n  r 


Vn(m) 


-vy 


hF<%> 


+  0(E  '’n  '’{log  n)*5),  n 


where 


(jj  lies  between 
n 


E  and  W  , 
P  n,k 


n 


(The  proof  is  similar  to  that  of  Theorem  3.1.) 


LEMMA  3.3  will  be  used  to  advantage  in  Section  4.  One  can 
obtain  a  further  variant  of  Theorem  3.1,  involving  further  relaxation 
of  the  regularity  condition  on  H  but  yielding  a  slower  rate  in 

r 

(3.17)  and  only  in  probability  instead  of  almost  surely.  This  is 
analogous  to  a  variant  of  Bahadur's  result  given  by  Ghosh  (1971). 
However,  for  the  results  of  Section  4,  the  version  given  by  Lemma  3.3 
is  needed. 


4.  Sequential  nonparametric  fixed-width  confidence  intervals. 
Let  integers  a(n)  and  b(n)  be  defined  via  the  formulas  of  (2.10)  and 
(2.11)  with  replaced  by  an  estimator  £pn*  Let  N(  =N(d))  be  the 
smallest  integer  n  >  nQ  for  which 


(4.1) 


W  ,  ,  , 
n,b  (n) 


Wn,a (n) 


<  2d, 


for  some  specified  d  >  0  and  nQ  >  1.  We  consider  the  sequential 
2d-width  confidence  interval 


(4.2) 


(WN,a(N) '  WN,b(N) ) 


for  estimation  of  E  =  H_  1 (p) .  The  key  properties  of  this  sequential 

P  t 

confidence  interval  procedure  are  given  by  the  following  result. 


THEOREM  4.1.  Suppose 
order  A  at  E  ,  for  some  A  > 

r 

estimator  of  r  .  Then  the 
P 

procedure  defined  by  (4.2), 


that  Hp  =  hp  is  positive  and  Lipschitz  of 

A 

0.  Let  £pn  be  a  strongly  consistent 
sequential  fixed-width  confidence  interval 
and  the  random  sample  size  N  required  by 


U 
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r 

% 


this  procedure ,  have  the  properties 


(a)  N  is  well-defined  for  all  d  >  0,  N(=N(d))  is  a  nondecreasing 
function  of  d  as  d  decreases  to  0,  lim^g  N(d)  =  °»  a.s.,  and 

limd^0  E  (N)  =  ». 

(b)  lima,0  Nd2  -  [t‘1(l-a)]2m2?p/h2(5p)  ■ 

(c)  li">a.0PtwN>a(N)  <  Ep  5  «N,b(N))  =  1-2=.  . 

Under  the  additional  assumption 


(4.3) 


*  8 

sup„  EUiL)  <  °o  for  some  B  >  1, 
n  >  Hq  pn 


we  have  also 


(d)  limd^0  E(N)d2  =  [4>“1(1  -  a)  ]2m2Cp/h2(?p)  ;  and  E  (N)  <  »,d  >  0. 


We  first  establish  three  lemmas  needed  for  the  proof  of  this 
theorem. 

LEMMA  4.1.  Let  integers  a(n),b(n)  be  defined  via  the  formulas 
of  (2.10),  (2.11)  with  £  replaced  by  a  strongly  consistent  estimator 

A 

C  .  Let  H„  satisfy  the  assumptions  of  Theorem  4.1.  Then  a.s. 
pn  r 

(4-4)  n',lwn,b(n)  -"n.aln)1  *  2»'1U  -  “>">?p‘'/VEp>  . 


PROOF.  By  the  strong  consistency  of  Cpn»  it  follows  that  a.s. 


(4.5) 


a(n)/n,  ,  =  p  +  o(e  _)  ,  n  -*•  «  , 


(4.6) 


b  (n) /n (m)  =  P  +  o(en),  n  -*•  ®  , 


where,  with  cQ  >  1, 


(4.7) 


en  =  (c0  log  n)  >Sm,’n”,,/hF(Cp)  • 


Hence  Lemma  3.3  applies  twice,  with  kn  given  by  a  (n)  and  b(n) ,  and 
yields  a.s. 


<4-8»  n,,(W„,b(„)  -Wn,a(n)>  “  ^  <P  *  <Cp>  1  ftp'1  (%2) 


+  ♦-1(l-a)«^  (hF_1(<2)  + 


+  CKe^  (log  n)*3)  ,  n  •>  «  , 


where  w  ,  lies  betwen  E  and  Vf  „  ,  »  ,  and  u>  between  E  and  Vf  .  .  . 

nl  p  n,a(n)'  n2  ^p  n,b(n) 

By  Lemma  3.1,  and  the  Lipschitz  assumption  on  hp,  we  have  a.s. 


(hF_1(a)*2)  -hF"1(a)*1))  =  0(eA)  «  0(n_A/2(log  n)A/2),  n 


Now,  by  the  law  of  iterated  logarithm  for  U-statistics  (Serfling  (1980)) 

n*3  [p  -  Hn(£p)  ]  =  0  (  (log  log  n)*1)  a.s.  . 

a-  s. 

Hence  the  first  term  on  the  right-hand  side  of  (4.8)  0.  The 

a.s. 

third  term  clearly  -*•  0  also,  and  the  second  term  -*■  to  the  right- 
hand  side  of  (4.4).  Q 


satisfying 


LEMMA  4.2.  For  N  corresponding  to  (4.1),  with  £pn 

2 

(4.3),  the  r.v.'s  {Nd  )d>Q  are  uniformly  integrable. 

PROOF.  By  the  proof  of  Lemma  3.2  of  Bickel  and  Yahov  (1968),  it 
suffices  to  prove 

00  2 

(4.9)  1  sup  P{Nd  >  r}  <  “, 

r=l  0<d<dg 

for  some  d^.  Let  us  write 

P{Nd2  >  r}  <  P{N  >  [r/d2]} 

(4.10)  <  P(W  9  -  -W  j  >  2d}  . 

[r/d^] ,b( [r/d^] )  [r/dz] , a ( [r/d^] ) 

By  routine  but  tedious  arguments  (see  Choudhury  (1984),  pp.  32-36,  for 
details),  one  can  bound  the  right-hand  side  of  (4.10)  by  a  function  of 
r  and  d  which  is  finitely  summable  in  r  uniformly  in  d  <  d^,  for 
sufficiently  small  dQ.  [] 

LEMMA  4.3.  Let  Un  be  the  U-statistic  based  on  kernel  h(x^,««»,X 

2 

and  a  random  sample  X^,...,Xn,  n  >  m.  Let  Eh  <  °°  and  r)j  >  0, 
where 

(4.11)  0j_  =  Var{E{^h(Xi  ,...,Xi  )  /ml  | X^ } )  , 

1  m 

and  denotes  summation  over  all  permutations  of  (l,...,m).  Let 
{N^}  be  positive  integer-valued  r.v.'s  and  {a^}  positive  constants. 


such  that 


(4.12)  a^  »  as  A  AQ 

and 


(4.13)  N^/a^  "*  0  Probability ’  as  b  ■+  Aq, 

with  Cq  a  finite  constant.  Then 
jl  d  2 

(4.14)  Na’(Un  -  Eh)  -*■  N(0,m  n1)  as  A  -*•  AQ  . 

i  A 

PROOF.  Put 


(4.15) 


where 


Un  =  Un  +  V 


A 


l  E{U  |X.} 

i-1 


(n-l)Eh. 


Then  by  Geertsema  (1970)  (or  see  Serfling  (1980),  p.  189),  we  have 


(4.16) 


l  a .  s . 

n  Rn  — ►  0,  n  -*•  00  . 


Let  e  >  0  be  given.  Then 

P{N.‘!|R  |  >  t}  <  p{  sup  k'lBjjl  >  e}  +  P(Na  <  CQa4/2)  . 

4  k>c0a4/2 


As  A  -*■  A.,  the  first  term  on  the  right  tends  to  0  by  virtue  of  (4.12) 


and  (4.16),  and  the  second  term  tends  to  0  by  (4.13).  Hence  we 
obtain 


(4.17) 


^  V  *  0,  as  A  -  A0< 
A 


Also,  by  the  classical  CLT,  we  have 


(4.18) 


n^  (Un  -  Eh)  i  N(0,m2n1) 


m 

t  j 

*  vi>-V 

•>V-\ 


Thus,  by  the  Doeblin-Anscombe  CLT  for  randomly  indexed  sums  (see 
Chow  and  Teicher  (1978) ,  p.  317) ,  we  have 


(4.19)  N,^(U„  -Eh)  ->  N (0,m2rii )  ,  as  A  +  A_. 

A  N  A  1  U 

A 


Combining  (4.17)  and  (4.19),  the  desired  (4.14)  follows.  [] 


PROOF  OF  THEOREM  4.1.  (a)  By  Lemma  4.1,  i.e.,  by  the  convergence 

(4.4),  we  can  easily  deduce  that  N(d)  is  finite  a.s.,  that  N(d)  is 
nondecreasing  and  +  “  a.s.  as  d  +  0,  and  finally  (by  monotone 
convergence)  that  EN(d)  -*■  °°  as  d  -*■  0. 

(b)  Noting  that 


WN,b(N)  "  WN,a(N)  -  2d  <  WN-l,b (N-l)  "  WN-l,a(N-l)  ' 

we  have  by  the  convergence  N  -*•  »  as  d  0  and  again  the  convergence 
(4.4)  that 

(4.20)  liind->0  Nd2  =  t^"1(l-a>)2m2;p/hF2(5p)  a.s. 


sv.-.vl 


.  .1 


Ur3 


fc  *  •  n  »  ' 

.  V  •  i  , 
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(c)  It  is  readily  seen  that 


P(WN,a(N)  i  5p  S  Vb(N)>  ”  Hn’|hn(5  )-p|<  mi  ■>). 


d-*0 


d-*-0 


pN 


and  thus  claim  (c)  follows  via  Lemma  4.3  applied  to  the  U-statistic 
Hn(£p)  and  Slutsky's  Theorem. 

2 

(d)  Since  by  Lemma  4.2  the  r.v.’s  {Nd  )^>q  are  uniformly  integrable, 
the  convergence  (4.20)  holds  also  in  expectation.  Finally, 

E(N)  <  ",  d  >  0,  since  the  uniform  integrability  also  implies 

Supd-*0  E*Nd2>  <  00  * 

This  completes  the  proof.  Q 


REMARKS.  (i)  As  noted  previously,  the  parameter  £  is 

P 

distribution-free  only  in  exceptional  cases,  so  that  in  typical 
applications  a  strongly  consistent  estimator  is  needed.  Such  an 
estimator  is  provided  in  Section  5. 

(ii)  The  asymptotic  relative  efficiency  as  d  0  of  a  sequential 
fixed-width  confidence  interval  T  relative  to  another  such  procedure 
S  may  be  taken  as 


e  (T ,  S)  =  limd^0E(Ns)/E(NT)  . 

For  procedures  satisfying  Theorem  4.1,  we  obtain  e(T,S)  immediately 
from  part  (d)  of  the  theorem.  In  particular,  for  the  generalized 
Hodges-Lehmann  location  estimators  corresponding  to  Hn-1(Jj) 

for  the  kernel  h(x.,...,x_)  =  (x.  +  .  . .  +x  )/m,  we  have  the  formula: 

x  iii  .L  in 


-  - 


■r.  -r:  -  . 


(4.21) 


e(HL(m>'  s>  =  £l  ■ 

m 


where  f  (•)  denotes  the  density  function  of  the  r.v.  X  =  (X,  +  ...  +  X  )/m, 
-  m  i  m 

denotes  the  variance  of  the  df  F,  ^  (F)  =  VarF(FQ  (X)  } ,  and 

F(x)  =  Fq(x-^Jj)  with  Fq  symmetric  about  0.  Values  of  (4.21)  for 


choices  of 

F  and  m  = 

1 1  2 ;  •  •  •  /  5 

are  as  follows. 

m 

F 

1 

2 

3 

4 

5 

Normal 

.637 

.955 

.981 

.989 

.993 

Uniform 

.333 

1.000 

.849 

.906 

.919 

Logistic 

.822 

1.097 

1.103 

1.083 

1.077 

Laplace 

2.000 

1.500 

1.321 

1.238 

1.190 

2 

5.  Estimation  of  the  nuisance  parameter  C  .  We  note  that  m  t 

F  r 

is  the  asymptotic  variance  parameter  of  the  U-statistic  based  on  the 
kernel 


(5.1) 


g  (x •  *  * ' xm^  —  ^  ^ ^ (^i » 


'xm>  ±  Hl 


-1 


(p)  >• 


Sen  (1981),  §3.7,  for  example,  gives  methodology  for  construction  of 
strongly  consistent  estimators  for  the  asymptotic  variance  parameters 
of  U-statistics.  However,  these  methods  assume  that  the  kernel  of 
the  U-statistic  is  completely  known.  In  the  present  case,  we  have 
in  (5.1)  a  kernel  involving  an  unknown  parameter  £p  =  HF  1  (p) . 
Consequently,  we  develop  an  estimator  by  a  different  method. 

Specifically,  we  introduce  a  family  of  kernels. 
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indexed  by  A,  such  that 


Cp  -  EK(X1,...,X2m_1,5p)  . 

We  denote  by  Un(A)  the  U-statistic  based  on  the  kernel  in  (5.2). 
Then  a  natural  estimator  of  is  given  by  Un(Ep).  Since,  however, 

a  _  i 

Ep  is  unknown,  we  substitute  its  estimator  Epn  =  Hn  (p) ,  arriving 
at  the  estimator 


(5.3) 


C  =  U  (E  )  . 
pn  n  pn 


THEOREM  5.1.  If  H„  is  continuous  at  E  ,  then  a.s.  t  -*•  t  ,  n 

r  p  pn  p 


PROOF.  Define 


J(x1, . . .  ,xm,y)  =  (mi)-1  ^l{h(xi  )  <  y), 

1  m 

where  the  sum  is  over  permutations  (i^,...,im)  of  (l,...,m),  and 


m-1 

G (x,y)  =  J...J J(x., ,x  ,,x,y)  IT  dF(x.). 

1  m-J-  i=l 


Then  the  parameter 


may  be  expressed  as 


OO 


(5.4) 


=  Var  {G (X, £  ) }  =  E^G2  (X, £  ) }  -  p2. 


Defining 


9  (y)  =  /G2(x,y)dF(x)  -  p2 


K(xi' *  * • 'x2m-2'X/y)  =  J (xi' • * • ,xm-i'x,y) J  ^xm'  ’ *  * ,x2m-2 


X9n,-9'X»y)  -p2‘-ii£ 


we  have 


2m- 1 

/ • • • /k (x^ / . . . , X2m—1 ' y ^  ^  dF (x . ) 

i_l 


s =  ev- 


Noting  that  K  (x^, . .  .  'x2m-i'y)  i"s  monotone  the  argument  yr  and  that 
this  function  is  continuous  at  y  =  ^  with  probability  1  with  respect 
to  the  probability  measure  Ilf_^  dF(x^),  we  obtain  by  the  monotone 
convergence  theorem  that  the  function  0 (y)  is  continuous  at  y  =  £  . 

Now  let  e  >  0  be  given  and  choose  5  >  0  such  that  1 0  (y)  -  0  (£  )  |  <e 

P 

for  |y  -  £  |  <  6.  By  the  monotonicity  of  the  kernel  K  (x^ , . . .  »x2m_]_»y) 

A 

in  the  argument  y,  and  by  strong  convergence  of  £  to  £  (recall 

r*A  r 

(2.4)),  we  have  that  a.s. 


*•  V  A 


<5-5)  Cpn  =  VV  6  ‘u„<«p-S,'Dn"p  +  4n 


for  all  n  sufficiently  large.  By  the  almost  sure  convergence  of 
U-statistics,  the  interval  in  (5.5)  is  a.s.  contained  in  the  interval 
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*  r*™. 


[0UO  -  5)  -  e,  0  ( Cp  +  <5 )  +  e]  for  all  n  sufficiently  large. 

But  this  latter  interval  is  contained  in  1 0  < Cp)  -2e,  9(£p)  +  2e]  , 

i.e.,  in  the  interval  [r  -2c,  5  +  2e]  .  Q 

P  P 

REMARK.  The  estimator  r  given  above  requires  0(n2m-^) 
computational  steps.  In  the  case  that  0(n)  computational  ease  is 
desired,  one  can  use  instead  the  estimator 


5 


pn 


,X2m-l'^pn) 


K(x2m"’"x4m-2'5pn)  +  ••*> 


which  is  also  strongly  consistent  but  less  efficient  than 


a 
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